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SEQUENTIAL SELECTION OF A MONOTONE SUBSEQUENCE 
FROM A RANDOM PERMUTATION 
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Abstract. We find a two term asymptotic expansion for the optimal expected 
value of a sequentially selected monotone subsequence from a random permu¬ 
tation of length n. A striking feature of this expansion is that tells us that the 
expected value of optimal selection from a random permutation is quantifi- 
ably larger than optimal sequential selection from an independent sequences 
of uniformly distributed random variables; specifically, it is larger by at least 
(1/6) logn + 0(l). 
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1. Sequential Subsequence Problems 

In the classical monotone subsequence problem, one chooses a random permu¬ 
tation TT : [1 : n] —>■ [1 : n], and one considers the length of its longest increasing 
subsequence, 

Ln = max{/c : 7r[ii] < '^[ 12 ] < ■ ■ ■ < T^[ik\ where 1 < *i < *2 • • • < < n}. 

On the other hand, in the sequential monotone subsequence problem one views the 
values 7r[l], 7r[2], ... as though they were presented over time to a decision maker 
who, when shown the value 7r[j] at time i, must decide (once and for all) either to 
accept or reject 7r[i] as element of the selected increasing subsequence. 

The decision to accept or reject 7r[i] at time i is based on just the knowledge of the 
time horizon n and the observed values 7r[l],7r[2],... ,7r[i]. Thus, in slightly more 
formal language, the sequential selection problems amounts to the consideration of 
random variables of the form 

(1) = max{fc : 7r[Ti] < ■k[t 2 ] < • • • < 7r[Tfc] where 1 < n < T 2 • • • < r*, < n}, 

where the indices Ti, i = 1,2,... are stopping times with respect to the increasing 
sequence of cr-fields Tk = cr{7r[l], 7r[2],..., 7r[fc] }, 1 < fc < n. We call a sequence of 
such stopping times a feasible selection strategy, and, if we use r as a shorthand for 
such a strategy, then the quantity of central interest here can be written as 

(2) s(n) = supE[L^], 

T 

where one takes the supremum over all feasible selection strategies. 
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It was conjectured in Baer and Brock (1968) that 

(3) s{n) ^ as n —>■ oo, 

and a proof of this relation was first given in Samuels and Steele (1981). A much 
simpler proof of ([3]) was later given by Gnedin (2000) who made use of a recursion 
that had been used for numerical computations by Baer and Brock (1968). The 
main purpose of this note is to show that by a more sustained investigation of that 
recursion one can obtain a two term expansion. 

Theorem 1 (Sequential Selection from a Random Permutation). For n ^ oo one 
has the asymptotic relation 

(4) s(n) = V^+i logn + 0(1). 

Given what is known for some closely related problems, the explicit second order 
term (logn)/6 presents something of a surprise. For comparison, suppose we con¬ 
sider sequential selection from a sequence of n independently uniformly distributed 
random variables Xi, X 2 , ..., A„. In this problem a feasible selection strategy r is 
again expressed by an increasing sequence of stopping times Tj , j = 1,2,..., but now 
the stopping times are adapted to the increasing ct- fields Fj = tT{Ai, A2, ... , Xj }. 
The analog of o is then 

(5) = maxjfc : X^ < Xr^ < ■ ■ ■ < Xr^ where 1 < n < r 2 • • • < < n}, 

and the analog of ([2]) is given by 

s(n) = supE[L^]. 

T 

It was proved by Bruss and Robertson (1991) that for s'(n) one has a uniform 
upper bound 

(6) s(n) < for all n > 1, 

so, by comparison with (|3]), we see there is a sense in which sequential selection 
of a monotone subsequence from a permutation is easier than sequential selection 
from an independent sequence. In part, this is intuitive; each successive observation 
from a permutation gives useful information about the subsequent values that can 
be observed. By (|3]) one quantifies how much this information helps. 

Since holds for all n and since is only asymptotic, it also seems natural 
to ask if there is a relation between s(n) and s{n) that is valid for all n. There is 
such a relation if one gives up the logarithmic gap. 

Theorem 2 (Selection for Random Permutations vs Random Sequences). One has 
for all n = 1,2,... that 

's{n) < s{n). 

Here we should also note that much more is known about s'(n) than just ® ; in 
particular, there are several further connections between s(n) and 's(n). These are 
taken up in a later section, but first it will be useful to give the proofs of Theorems 

[Hand [21 

The proof of Theorem[I]takes most of our effort, and it is given over the next few 
sections. Section [21 develops the basic recurrence relations, and Section [31 develops 
stability relations for these recursions. In Section[Tlwe then do the calculations that 
support a candidate for the asymptotic approximation of s(n), and we compete the 
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proof of Theorem [T] Our arguments conclude in Section [5] with the brief — and 
almost computation free — proof of Theorem [2l Finally, in Section El we discuss 
further relations between s(n), s(n), and some other closely related quantities that 
motivate two open problems. 


2. Recurrence Relations 


One can get a recurrence relation for s{n) by first step analysis. Specifically, we 
take a random permutation tt : [1 : n + 1] —>■ [1 : n + 1], and we consider its initial 
value 7r[l] = /c. If we reject 7r[l] as an element of our subsequence, we are faced with 
the problem of sequential selection from the reduced random permutation tt' on an 
n-element set. Alternatively, if we choose 7r[l] = fc as an element of our subsequence, 
we are then faced with the problem of sequential selection for a reduced random 
permutation tt" of the set {k + l,k + 2,... ,n + 1} that has n + 1 — k elements. By 
taking the better of these two possibilities, we get from the uniform distribution of 
71 [1] that 

^ n+1 

(7) s(n + 1) =-max{s(n), 1 + s(n + 1 —/c)}. 

n+1 

k=l 

From the definition ([2]) of s{n) one has s(l) = 1, so subsequent values can then be 
computed by ©• For illustration and for later discussion, we note that one has the 
approximate values: 


77 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

s(n) 

1 

1.5 

2 

2.375 

2.725 

3.046 

3.333 

3.601 

3.857 

4.098 


1.414 

2 

2.449 

2.828 

3.162 

3.464 

3.742 

4 

4.243 

4.472. 


Here we observe that for the 10 values in the table one has s{n) < and, in 
fact, this relation persists for all 1 < n < 174. Nevertheless, for n = 175 one has 
< s(n), just as O requires for all sufficiently large values of n. 


We also know from @ that the map n i-A s(n) is strictly monotone increasing, 
and, as a consequence, the recursion © can be written a bit more simply as 


( 8 ) 


s{n + 1) 


1 

- max 

77+1 l<fc<n 


{n — k-\- 1 ) 5 ( 77 ) 


n 

E + 


1 

- max 

77+1 l<A:<n 


(77 


k + l)s(n) + k + 


E 




In essence, this recursion goes back to Baer and Brock (1968, p. 408), and it is the 
basis of most of our analysis. 


3. Comparison Principles 

Given a map g : N —>■ M and 1 < fc < n, it will be convenient to set 

n 

(9) H{n,k,g) = k + {n-k + l)g{n)+ E 

i—n—k-\-l 

so the optimality recursion © can be written more succinctly as 

(10) s(n + 1) = —max H{n,k,s). 

^ ^ ^ ^ n+1 l<k<n ^ ^ 
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The next two lemmas make rigorous the idea that if g is almost a solution of (HOD 
for all n, then gin) is close to s{n) for all n. 

Lemma 3 (Upper Comparison). //15 : N —>■ K+, 1 < (/(I) + 5(1), and 

(11) -- max Hin, k,g) < gin + 1) + Sin + 1) for all n> 1, 

n + 1 l<k<n 

then one has 

n 

(12) s(n) < g{n) + 5(i) for all n > 1. 

Proof. Weset A(z) =(5(1)+5(2)H-and we argue by induction. Specifically, 

using dUD for 1 < * < n we have 

71 

H{n,k^ s) = k-\-in — kl)s{n) + s{i) 

n 

< fc + (n - fc + l)(g(n) + A(n)) + {g(i) + A(i)} 

so by monotonicity of A(-) we have 

—f-Hin,k,s) < —f-H{n,k,g) + A(n). 

n + 1 n + 1 

Now, when we take the maximum over A: £ [1 : n], the recursion m and the 
induction condition (ED, give us 

s(n + 1) < —- max H(n, k,g) + A(n) 

n + 1 l<fe<n 

< gin + 1) + 5(n + 1) + A(n) = gin + 1) + A(n + 1), 

so induction establishes (ED for all n > 1. □ 

Naturally, there is a lower bound comparison principle that parallels Lemma [31 
The statement has several moving parts, so we frame it as a separate lemma even 
though its proof can be safely omitted. 

Lemma 4 (Lower Comparison). // 5 : N —^ K'*', gil) — 5(1) < 1, and 
gin + 1) — 5(n + 1) < - max iL(n, fc, g) for all n > 0, 

n +1 l<k<n 

then one has 

71 

9{n) - E 5(*) < s(n) for all n > 1. 

i=l 

4. An Approximation Solution 
We now argue that the function / : N —^ R defined by 

(13) /(n) = \/^+bogn, 

6 

gives one an approximate solution of the recurrence equation ® for n i—>■ s{n). 
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Proposition 5. There is a constant 0 < B < oo such that for all n > 1, one has 

(14) — < -I max H(n, k, f) \ — f(n + 1) < Bn~^^^. 

n +1 (l<fc<n J 

First Step: Localization of the Maximum 

To deal with the maximum in (1141) . we first estimate 
k*{n) = locmaxfe H{n,k,f). 

From the definition (|9]) of H{n, k, f) we find 

H{n, k + l,f)- H{n,k,f) = 1- f{n) + f{n - k), 

and, from the definition 0 of /, we see this difference is monotone decreasing 
function of k; accordingly, we also have the representation 

(15) k*{n) = 1 + max{/c : 0 < 1 — /(n) + /(n — k)}. 

Now, for each n = 1, 2,... we then consider the function Dn : [0, n] —>■ R defined by 
setting 

Dn{x) = 1 - /(n) + f{n - x) = 1- {V^ - v^2(n - a;)} - i{logn - log(n - a;)}. 

This function is strictly decreasing with 0^(0) = 1 and Dn{n) = —oo, so there is 
a unique solution of the equation Dn{x) = 0. For x G [0,n) we also have the easy 
bound 

1 /■^" 11 X 

Dn{x) = 1- - du- - log(n/(n - a;)) < 1-=. 

2 J2(n-x) yu 6 V2n 

This gives us Dn{V^) < 0, so by monotonicity we have Xn < V^. 

To refine this bound to an asymptotic estimate, we start with he equation 
Dn{xn) = 0 and we apply Taylor expansions to get 

1 = |l - (1 - a:„/n)^/^| - i log(l - Xn/n) 

= (1^ + 0(a;^/p.^)| + 0{xnln). 

By simplification, we then get 

(16) = Xn + 0{x\ln) + 0(a:„/n^/^) = Xn + 0(1), 

where in the last step we used our first bound a;„ < 

Finally, by (USD and the characterization (HU, we immediately find the estimate 
that we need for k*{n). 

Lemma 6. There is a constant A > 0 such that for all n > 1, we have 

(17) A<k*{n) +A. 


Remark 7. The relations (|16l) and (flTl) can be sharpened. Specifically, if we use a 
two-term Taylor series with integral remainders, then one can show \/2n— 2 < a;„. 
Since we already know that Xn < we then see from the characterization (na 

and integrality of k*(n) that we can take A = 2 in Lemma [6l This refinement does 
not lead to a meaningful improvement in Theorem [I] so we omit the details of the 
expansions with remainders. 
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Completion of Proof of Proposition [5] 


To prove PropositionjSl we first note that the definition (jH]) of H{n, k, /) one has 
for all 1 < A: < n that 


(18) 


H{n,k,f) = f(n) + 


n + 1 


n + 1 


' k-l 'j 

- f{n-i)) \ 

. i=l J 


The task is to estimate the right-hand side of (1181) when k = k*{n) and k*{n) is 
given by (nsi). 

For the moment, we assume that one has k < Dy/n where > 0 is constant. 
With this assumption, we find that after making Taylor expansions we get from 
explicit summations that 


k-l 


k-l 


k-l 




i=l 


/logn log(n — i) 
V“6 


6 


fc—1 , . .2 

W I * * 


+ 0 


7,5/2 


k-l 




(19) = 


ik-l)k (fc-l)fc(2fc-l) {k-l)k 


2\f2in 


2^n\f2iri 


Yin 


0(. 


,-1/2 


where the implied constant of the remainder term depends only on D. 

We now define r(n) by the relation k*{n) = y/^ + r{n), and we note by (HID 
that |r'(n)| < A. Direct algebraic expansions then give us the elementary estimates 


{k*{n) - l)k*{n) 1 


1277 


= j + o0 


,-1/2 


and 


(fc*(T7) - l)fc*(77)(2fc*(77) - 1) _ 1 , _l/2^ 

2477^^ 6^ ^ 


where in each case the implied constant depends only on A. 

Estimation of the first summand of m is slightly more delicate than this since 
we need to account for the dependence of this term on r(n); specifically we have 

{k*{n) — l)k*{n) (v^ -b r{n) — l) (v^ -|- r{n)) 

2yfl^ 2-\/2n 

= yfr^l -b r(77) - ^ + 0(77“^/^). 

Now, for a pleasing surprise, we note from the last estimate and from the definition 
of k*(n) and r{n) that we have cancelation of r{n) when we then compute the 
critical sum; thus, one has simply 


fe*(n) —1 


( 20 ) 


k*{n)- Y (/W-/(ii-*)) = vW2+g+0(77 ^/2). 


2 = 1 


Finally, from the formula (fTH)) for /(•), we have the Taylor expansion 

(21) /(77 + 1)-/(77) = ^ + T+0(„-3/2)^ 

V 2n on 
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SO, when we return to the identity (fT 8 |l . we see that the estimates ([20|l and CID 
give us the estimate 

- /(^ + 1 ) 

77+1 l<k<n j 

= ^ + f{n) - f{n + 1 ) = 0 (n"^/^). 

Here the implied constant is absolute, and the proof of Proposition [5] is complete. 
Completion of Proof of Theorem [T] 

Lemmas |3] and [4] combine with Proposition [5] to tell us that by summing the 
sequence n = 1 , 2 ,... and by writing C,{z) = 1 + 2 “^ + 3“^ + • • • one has 

|s(n) — f{n)\ < C(3/2)H < (2.62)i? for all n > 1. 

This is slightly more than one needs to complete the proof of Theorem [T] 

5. Proof of Theorem [5] 

The sequential monotone selection problem is a finite horizon Markov decision 
problem with bounded rewards and finite action space, and for such problems it is 
known one cannot improve upon an optimal deterministic strategy by the use of 
strategies that incorporate randomization, (cf. Bertsekas and Shreve, 1978, Corol¬ 
lary 8.5.1). The proof Theorem [5] exploits this observation by constructing a ran¬ 
domized algorithm for the sequential selection of a monotone subsequence from a 
random permutation. 

We first recall that ifci, i = l,2,...,n-|-l are independent exponentially dis¬ 
tributed random variables with mean 1 and if one sets 

Y _ El + £2 + ■ ■ ■ + Gi 

Cl -|- 62 -f • • • -f e„+l 

then the vector (Yi,Y 2 ,... ,Y„) has the same distribution as the vector of order 
statistics X( 2 ), ■.., -^(n)) of an i.i.d. sample of size n from the uniform distri¬ 

bution. Next we let A denote an optimal algorithm for sequential selection from an 
independent sample Xi,X 2 ,..., X^ from the uniform distribution, and we let t{A) 
denote the associated sequence of stopping times. If denotes the length of the 
subsequence that is chosen from from Xi, X 2 ,..., Xn when one follows the strategy 
t{A) determined by A^ then by optimality of A for selection from Xi, X 2 ,..., Xn 
we have 

s{n) = supE[L;;] = 

T 

We use the algorithm A to construct a new randomized algorithm A' for se¬ 
quential selection of an increasing from a random permutation tt : [n] 1 —^ [n]. 
First, the decision maker generates independent exponential random variables e^, 
f = 1, 2,..., n -|-1 as above. This is done off-line, and this step can be viewed as an 
internal randomization. 

Now, for i = 1, 2,..., n, when we are presented with 7r[f] at time f, we compute 
Xi = Finally, if at time i the value Xi would be accepted by the algorithm 

A, then the algorithm A' accepts 7r[i]. Otherwise the newly observed value 7r[i] is 
rejected. By our construction we have 

( 22 ) EK(^')]=EK(^)] = s(n). 
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Moreover, A' is a randomized algorithm for construction an increasing subse¬ 
quence of a random permutation tt. By definition, s(n) is the expected length of 
a monotone subsequence selected from a random permutation by an optimal de¬ 
terministic algorithm, and by our earlier observation, the randomized algorithm A' 
cannot do better. Thus, from (|2^ one has s(n) < s(n), and the proof of Theorem 
[T]is complete. 


6. Further Connections and Considerations 
As we noted before, Bruss and Robertson (1991) discovered the uniform bound 

(23) s'(n) < \/2n for all n > 1, 

and their proof depended on a general bound for the expected value of the partial 
sums of the smallest order statistics of a uniformly distributed random sample. 
Gnedin (1999) later gave a much different proof of (^5)) and generalized the bound 
in a way that accommodate random samples with random sizes. More recently, 
Arlotto, Mossel and Steele (2015) obtained yet another proof (1231) as a corollary to 
bounds on the quickest selection problem^ which is an informal dual to the tradi¬ 
tional selection problem. 

Since the bound (12311 is now well understood from several points of view, it is 
reasonable to ask about the possibility of some corresponding uniform bound on 
s(n). The numerical values that we noted after the recursion (1231) and the relation 

(24) s(n) = + ^\ogn + 0(1) 

6 

from Theorem [T] both tell us that one cannot expect a uniform bound for s{n) that 
is as simple as that for 's(n) given by (1231) . Nevertheless, numerical evidence suggest 
that the 0(1) term in (l24l) is always negative. The tools used here cannot confirm 
this conjecture, but the multiple perspectives available for (1^ give one hope. 

A closely related issue arises for s'(n) when one considers lower bounds. Here the 
first steps were taken by Bruss and Delbaen (2001) who considered i.i.d. samples of 
size JV^ where JV^ is an independent random variable with the Poisson distribution 
with mean u. In the natural (but slightly overburdened) notation, they proved that 
there is a constant c > 0 such that 

\P 2 I 1 — clogu < s(u); 

moreover, Bruss and Delbaen (2004) subsequently proved that for the optimal fea¬ 
sible strategy r* = (ri, r 2 ,...) the random variable 

= max{fc : < Xr^ < • • • < Xr^, where 1 < n < T 2 • • • < ta, < N^}, 

also satisfies a central limit theorem. Arlotto, Nguyen and Steele (2015) considered 
the de-Poissonization of these results, and it was found that one has the corre¬ 
sponding CLT for where the sample size n is deterministic. In particular, one 
has the bounds 

\/^ — clog n < 'sin) < 

Now, by analogy with (l24l) . one strongly expects that there is a constant c > 0 such 
that 


(25) 


s(n) = — clogn -I- 0(1). 
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Still, a proof this conjecture is reasonably remote, since, for the moment, there is 
not even a compelling candidate for the value of c. 

For a second point of comparison, one can recall the non-sequential selection 
problem where one studies 

£{n) = E[max{fc : < ... < 1 < n < *2 < • • • < 4 < n}]. 

Through a long sequence of investigations culminating with Baik, Deift and Jo¬ 
hansson (1999), it is now known that one has 



(26) 


where the constant a = 1.77108... is determined numerically in terms of solutions 
of a Painleve equation of type II. Romik (2014) gives an elegant account of the 
extensive technology behind (1^51) . and there are interesting analogies between i(n) 
and s{n). Nevertheless, a proof of the conjecture (|2^ seems much more likely to 
come from direct methods like those used here to prove (IMl) . 

Finally, one should note that the asymptotic formulas for n i—>■ £(n), n i—>■ s(n), 
and n i—>■ s'(n) all suggest that these maps are concave, but so far only n i—>■ s(n) 
has been proved to be concave (cf. Arlotto, Nguyen and Steele (2015, p. 3604)). 
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